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We propose a novel approach to the recognition of particular classes of non-conventional events 
in signals from phase-sensitive optical time-domain-reflectometry-based sensors. Our algorithmic 
solution has two main features: filtering aimed at the de-nosing of signals and a Gaussian mixture 
model to cluster them. We test the proposed algorithm using experimentally measured signals. 
The results show that two classes of events can be distinguished with the best-case recognition 
probability close to 0.9 at sufficient numbers of training samples. 


The problem of the development of novel sensor tech¬ 
niques plays a crucial role in science and technology. One 
of the most important classes of sensing systems is dis¬ 
tributed sensors, which are of great importance for the 
remote control of extended objects [1-8]. Phase-sensitive 
coherent optical time-domain reflectometry (d>-OTDR) 
is a basic technique that can provide sufficient sensitiv¬ 
ity and resolution for these distributed sensing systems 
[4-6] . Standard OTDR techniques use light sources with 
coherence lengths, which are shorter than pulse lengths. 
This can yield a sum of backscattered intensities from 
each scattering center, which allows one, e.g., to control 
splices and breaks in fiber cables [8]. On the contrary, 
in $~OTDR-based sensing systems [9-14] , the coherence 
length of lasers is longer than their pulse length. An event 
near the optical fiber generates an acoustic wave that af¬ 
fects the fiber by changing the phases of the backscat- 
tering centers. An analysis of such signals can reveal 
their impact on the sensor and monitor located near fiber 
objects [9-14]. A key stage in implementing dJ-OTBR- 
based sensors is the development of an algorithmic solu¬ 
tion to reveal unusual vibrations in the background. 

The problem of the recognition of non-conventional ac¬ 
tivity (a target event) consists of two closely related sub¬ 
problems. The first is related to the de-noising procedure, 
which allows the detection of an event in the background 
with high probability. The second and much more im¬ 
portant subproblem is the development of a classification 
methods aimed at clustering detected target events into 
predetermined classes. Due to the complex structure of 
the signals in such sensors, this is challenging [15-23]. 

In addition to guaranteeing high accuracy of recogni¬ 
tion, post-processing algorithms for 4>-OTDR-based sen¬ 
sors should be able to operate rapidly without significant 
computational resources. In other cases, their applica¬ 
tion in real-time distributed fiber optic sensing systems is 
substantially limited. In vibration sensing systems based 
on $-OTDR-based, there is at present no sufficiently 
fast and versatile algorithmic solution for the recogni¬ 
tion of events. A promising direction for the solution of 
this problem is using of a machine learning toolbox, in 


particular neural networks and pattern recognition tech¬ 
niques [20-23]. Recent results [21-23] have shown that 
recognition algorithms based on machine learning yield 
a classification accuracy close to one with a reasonably 
small recognition time. 

In this study, we employ techniques of machine learn¬ 
ing for the recognition of target events in 4>-OTDR-based 
distributed fiber-optic vibration sensor. The dJ-OTBR 
based sensing setup was used to collect experimental data 
for this study (for details, see [24-26]). As a potential ap¬ 
plication of the system, we have in mind the problem of 
access control for protected regions, with the fiber be¬ 
ing located along their perimeter. The system generates 
field distributions I {I, St) corresponding to time interval 
St. These raw data are obtained from the setup in blocks 
with ~10^ reflectograms, which are the result of mea¬ 
surements in one second. The input for post-processing 
is the block containing ^10^ reflectograms. 

The proposed detection procedure commences with the 
de-noising of measured signals It{l) by the median filter 
in the time domain of the dimension three. The proce¬ 
dure is intended to reduce the self-noises of the system. 
The structure of the self-noise resembles that of random 
noise, for the removal of which median filtering is an ef¬ 
fective technique [27]. 

In the next stage, we use another type of filtering. 
This is based on the 3-700 Hz band-pass filter to re¬ 
move the DC component of the noise as well as noise at 
high frequencies. The boundaries for the band-pass filter 
are adaptive, i.e., they have been optimized based on an 
analysis of the experimental data. Thus, we obtain the 
de-noised signal 

Following filtering, for every point-space step Ij , we cal¬ 
culate the standard deviation with a window of 20-50 
ms. The parameters of the time window were optimized 
according to the set of experimental data. However, the 
set of data with <j{lj) for every j was very large. Hence, 
we used a thinning procedure, where we split vector a{lj) 
on a region of 100 counts, calculate the maximum value 
for these regions, and form a new vector consisting 

of these maximum values. 
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Then, we can detect events based on whether thresh¬ 
old Xc is exceeded. This value is calculated adaptively 
with respect to the concrete set of experimental data. 
In fact, following the employment of the procedure de¬ 
scribed above, the detection of potential target events 
reduces to a search for local centers of mass in the space- 
time field distribution X{l,t), which exceeds the critical 
value. For our specific set of experimental data, the criti¬ 
cal value could be chosen as 10-100 of the standard devia¬ 
tion of reflectograms in the last 15 minutes. For our case, 
this critical value was X^ — 10®. It was calculated on the 
basis of an analysis of the instantaneous values of the sig¬ 
nals, which guaranteed events, and that of the mean val¬ 
ues of low noise points (only technical self-noises). That 
is, signals that did not contain non-conventional events 
and had a minimal effect on seismic noise had mean val¬ 
ues of X ~ 10® and standard deviations of ax — 10®, 
and those signifying high noise points (seismic noises on 
top of self-noises) had mean values of Xnoise — 10® and 
o'x noise — 10^. This yields efficient detection. 

For an event, we allocate a region of the signal contain¬ 
ing it and other events in its vicinity in the time domain. 
We are interested in the case where we have to separate 
two specific classes of events: single-target passage, and 
digging near the cable. 

On the one hand, it is important to note that our de¬ 
tection procedure allows us to efficiently locate events. 
On the other, important properties that can serve as 
benchmarks for recognition are lost during this proce¬ 
dure. That is why we separate the stages, and why input 
data for classification is once again constituted by raw 
measured reflectograms 

The first step is the formation of a feature space for the 
machine learning tool aimed at classification of detected 
events. For this purpose, we use cepstral coefficients [28], 
which are used in machine learning toolboxes aimed on 
recognition of complex acoustic signals [29, 30]. Here, 
we briefly describe their calculation for signal It{l) of 
discrete form in time domain /;[!], where 0 < t < N. It 
is important to note that for the formation of the feature 
space, we used a raw space-time field distribution. 

We apply to the signal Ii [t] the Fourier transform 

Yi[k]=^^^ Q Ii[t]G'>cp[—2Trikt/N], 0<k<N. (1) 


Using the window function, we employ the following se¬ 
quence of filters 
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where the frequency / [to] can be obtained from the fol¬ 
lowing relation 
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Here, M is the number of Mel coefficients, Fs=fd/2=1 
kHz is the Nyquist sampling rate, fd is the sampling fre¬ 
quency, and B{x) is the transformation to the Mel scale: 
H“®(a:)=700(exp [x/1125] — 1), where the numerical co¬ 
efficient corresponds to the most common formula for 
transformation to the Mel scale [29]. 

We then calculate the energy for every window (2) 


S[m] \Yi[k]f-Hmik)^ ■ (4) 

By applying the discrete cosine transformation to (4), 
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( 5 ) 


with 0 < n < M, we obtain the set of the cepstral coef¬ 
ficients in the Mel scale. 

The second step of the algorithm is the assignment of 
an event to classes. We use the Gaussian mixture model 
(GMM) for this. The GMM is a probabilistic model. The 
main assumption underlying the GMM is that all data 
points are generated from a mixture of a finite number 
of Gaussian distributions with unknown parameters. In 
our case, there are two set of data points with a Gaussian 
distribution. The feature space for the GMM is formed 
by the cepstral coefficients. We found that the optimal 
value of the cepstral coefficients was M = 10. 

The workflow of the GMM is as follows: The input 
data for classification is a vector K of cepstral coefficients 
of the signal. In the GMM framework [30], K is inter¬ 
preted as representing independent observations from a 
mixture of two multivariate normal distributions which, 
in fact, form the feature space for classification. Two nor¬ 
mal distributions are then generated, where A/'(/ii,Ei) 
corresponds to the input vector belonging to the first 
class and M{fi 2 ,Y, 2 ) corresponds to that belonging to 
the second class. Furthermore, the goal is to estimate 
unknown parameters representing the “mixing” value be¬ 
tween these normal distributions. After this, the GMM 
works on the basis of the expectation-maximization algo¬ 
rithm [31]. Thus, the algorithm computes for each point 
a probability of being generated by each component of 
the model. Consequently, one can tweak the parameters 
to maximize the likelihood of the data given those assign¬ 
ments. Repeating this process guarantees convergence to 
a local optimum. In terms of implementation, we used 
the GMM library described in the details in Ref. [32]. 

The collection of experimental data was organized as 
follows: We are interested in two types of events: single¬ 
target passage, and digging near the cable. During the 
experiment, we obtained 3.93 GB of raw data (2.51 GB 
corresponded to digging samples, and 1.42 GB to passage 
samples). We then collected 58 single-target passages at 
a distance of 0-5 m from the cable and 118 digging events 
at 0-10 m from it under similar conditions. The sug¬ 
gested algorithmic solution was executed on a standard 
PC with an Intel Core i5 M 460 processor and 8 GB of 
RAM. The average time taken by the classification stage 
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Tr. samples 

Test, samples 

Correct 

Correct, % 

35 

141 

103 

65,96 

44 

132 

97 

73,48 

52 

124 

103 

83,06 

61 

115 

93 

80,87 

70 

106 

91 

85,85 

79 

97 

77 

79,38 

88 

88 

82 

93,18 

96 

80 

66 

82,5 

105 

71 

63 

88,73 

114 

62 

56 

90,32 

123 

53 

50 

94,34 

132 

44 

38 

86,36 

140 

36 

32 

88,89 


TABLE I. Parameters of the test for the experimental verifi¬ 
cation of the proposed algorithm: number of training samples 
(tr. samples) for the GMM, number of test samples (test, sam¬ 
ples), number of correctly clustered events and percentage. 

of the algorithm was approximately 1 ms, which is rea¬ 
sonable in light of recently reported results. Therefore, 
the proposed algorithm is sufficiently fast to implement 


in real-time distributed fiber optic vibration sensing sys¬ 
tems to control protected areas. 

The estimated signal-to-noise ratio (SNR) in the col¬ 
lected experimental data was approximately 8.5 dB [33]. 
The effect of changes in SNR in the region on recogni¬ 
tion was sufficiently small in this case. The probability 
of recognition, which is defined as the ratio of successful 
recognition of events to all input events, with a varying 
number of training samples (Le., samples used for the for¬ 
mation of clusters) and testing samples is listed in Table 
I. Our results showed that the two classes of events can 
be detected and distinguished with the best-case proba¬ 
bility close to 0.9 using the proposed method. However, 
as it clearly seen from the table the probability strongly 
depends on number and properties of testing samples. 
These issues are interesting to be studied in future. 
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